Overview

Dataset statistics

Number of variables45
Number of observations123504
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory72.6 MiB
Average record size in memory616.0 B

Variable types

NUM25
BOOL13
CAT7

Reproduction

Analysis started2022-02-12 13:55:53.881280
Analysis finished2022-02-12 13:59:30.026380
Duration3 minutes and 36.15 seconds
Versionpandas-profiling v2.7.1
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Snapshotdate has a high cardinality: 79 distinct values High cardinality
PeriodDate has a high cardinality: 79 distinct values High cardinality
MacroMergeKey has a high cardinality: 79 distinct values High cardinality
RepDate has a high cardinality: 79 distinct values High cardinality
unemployment_lag8 is highly correlated with unemployment_lag6High correlation
unemployment_lag6 is highly correlated with unemployment_lag8High correlation
house_prices_all is highly correlated with house_purchase_pricesHigh correlation
house_purchase_prices is highly correlated with house_prices_allHigh correlation
real_gdp is highly correlated with nominal_gdp_lag8High correlation
nominal_gdp_lag8 is highly correlated with real_gdpHigh correlation
Quarter_Period is highly correlated with PeriodDate and 1 other fieldsHigh correlation
PeriodDate is highly correlated with Quarter_Period and 1 other fieldsHigh correlation
MacroMergeKey is highly correlated with PeriodDate and 1 other fieldsHigh correlation
RepDate is highly correlated with SnapshotdateHigh correlation
Snapshotdate is highly correlated with RepDateHigh correlation
foliolossrate is highly skewed (γ1 = 24.89867116) Skewed
foliolossrateLag1 is highly skewed (γ1 = 24.97215878) Skewed
foliolossrateLag2 is highly skewed (γ1 = 25.04617602) Skewed
foliolossrateLag3 is highly skewed (γ1 = 25.05261385) Skewed
foliolossrateLag4 is highly skewed (γ1 = 25.12720892) Skewed
foliolossrate has 88260 (71.5%) zeros Zeros
foliolossrateLag1 has 88291 (71.5%) zeros Zeros
foliolossrateLag2 has 88322 (71.5%) zeros Zeros
foliolossrateLag3 has 88348 (71.5%) zeros Zeros
foliolossrateLag4 has 88379 (71.6%) zeros Zeros
MovingAverage has 48089 (38.9%) zeros Zeros

Variables

FDICCert
Real number (ℝ≥0)

Distinct count154
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean314.88355033035367
Minimum35
Maximum847
Zeros0
Zeros (%)0.0%
Memory size482.6 KiB

Quantile statistics

Minimum35
5-th percentile51
Q1170
median284
Q3420
95-th percentile737
Maximum847
Range812
Interquartile range (IQR)250

Descriptive statistics

Standard deviation201.3433469
Coefficient of variation (CV)0.6394216105
Kurtosis0.09293437099
Mean314.8835503
Median Absolute Deviation (MAD)130
Skewness0.8140019517
Sum38889378
Variance40539.14333
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
256 882 0.7%
 
231 882 0.7%
 
428 882 0.7%
 
172 882 0.7%
 
363 882 0.7%
 
235 882 0.7%
 
107 882 0.7%
 
618 882 0.7%
 
234 882 0.7%
 
170 882 0.7%
 
Other values (144) 114684 92.9%
 
ValueCountFrequency (%) 
35 882 0.7%
 
39 882 0.7%
 
41 882 0.7%
 
46 882 0.7%
 
47 834 0.7%
 
ValueCountFrequency (%) 
847 882 0.7%
 
845 882 0.7%
 
829 810 0.7%
 
823 882 0.7%
 
769 882 0.7%
 

Snapshotdate
Categorical

HIGH CARDINALITY
HIGH CORRELATION
Distinct count79
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size965.0 KiB
12/31/2002
 
1848
6/30/2006
 
1848
6/30/2003
 
1848
12/31/2005
 
1848
12/31/2003
 
1848
Other values (74)
114264
ValueCountFrequency (%) 
12/31/2002 1848 1.5%
 
6/30/2006 1848 1.5%
 
6/30/2003 1848 1.5%
 
12/31/2005 1848 1.5%
 
12/31/2003 1848 1.5%
 
6/30/2002 1848 1.5%
 
3/31/2005 1848 1.5%
 
3/31/2006 1848 1.5%
 
9/30/2004 1848 1.5%
 
12/31/2001 1848 1.5%
 
Other values (69) 105024 85.0%
 

Length

Max length10
Mean length9.251870385
Min length9
ValueCountFrequency (%) 
Decimal_Number 10 90.9%
 
Other_Punctuation 1 9.1%
 
ValueCountFrequency (%) 
Common 11 100.0%
 
ValueCountFrequency (%) 
ASCII 11 100.0%
 

PeriodDate
Categorical

HIGH CARDINALITY
HIGH CORRELATION
Distinct count79
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size965.0 KiB
3/31/2006
 
1848
12/31/2007
 
1848
6/30/2005
 
1848
3/31/2008
 
1848
9/30/2007
 
1848
Other values (74)
114264
ValueCountFrequency (%) 
3/31/2006 1848 1.5%
 
12/31/2007 1848 1.5%
 
6/30/2005 1848 1.5%
 
3/31/2008 1848 1.5%
 
9/30/2007 1848 1.5%
 
6/30/2006 1848 1.5%
 
9/30/2004 1848 1.5%
 
9/30/2005 1848 1.5%
 
3/31/2005 1848 1.5%
 
6/30/2007 1848 1.5%
 
Other values (69) 105024 85.0%
 

Length

Max length10
Mean length9.248591139
Min length9
ValueCountFrequency (%) 
Decimal_Number 10 90.9%
 
Other_Punctuation 1 9.1%
 
ValueCountFrequency (%) 
Common 11 100.0%
 
ValueCountFrequency (%) 
ASCII 11 100.0%
 

Period
Categorical

Distinct count12
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size965.0 KiB
P1
 
11139
P2
 
10985
P3
 
10831
P4
 
10677
P5
 
10523
Other values (7)
69349
ValueCountFrequency (%) 
P1 11139 9.0%
 
P2 10985 8.9%
 
P3 10831 8.8%
 
P4 10677 8.6%
 
P5 10523 8.5%
 
P6 10369 8.4%
 
P7 10215 8.3%
 
P8 10061 8.1%
 
P9 9907 8.0%
 
P10 9753 7.9%
 
Other values (2) 19044 15.4%
 

Length

Max length3
Mean length2.233166537
Min length2
ValueCountFrequency (%) 
Decimal_Number 10 90.9%
 
Uppercase_Letter 1 9.1%
 
ValueCountFrequency (%) 
Common 10 90.9%
 
Latin 1 9.1%
 
ValueCountFrequency (%) 
ASCII 11 100.0%
 

foliolossrate
Real number (ℝ≥0)

SKEWED
ZEROS
Distinct count394
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.002085827179686488
Minimum0.0
Maximum0.6563
Zeros88260
Zeros (%)71.5%
Memory size965.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.0003
95-th percentile0.0094
Maximum0.6563
Range0.6563
Interquartile range (IQR)0.0003

Descriptive statistics

Standard deviation0.01241149338
Coefficient of variation (CV)5.950393925
Kurtosis985.8648979
Mean0.00208582718
Median Absolute Deviation (MAD)0
Skewness24.89867116
Sum257.608
Variance0.0001540451679
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 88260 71.5%
 
0.0001 2118 1.7%
 
0.0002 1805 1.5%
 
0.0003 1599 1.3%
 
0.0004 1542 1.2%
 
0.0005 1190 1.0%
 
0.0006 1003 0.8%
 
0.0007 912 0.7%
 
0.0008 907 0.7%
 
0.0009 788 0.6%
 
Other values (384) 23380 18.9%
 
ValueCountFrequency (%) 
0 88260 71.5%
 
0.0001 2118 1.7%
 
0.0002 1805 1.5%
 
0.0003 1599 1.3%
 
0.0004 1542 1.2%
 
ValueCountFrequency (%) 
0.6563 12 < 0.1%
 
0.4094 12 < 0.1%
 
0.3502 1 < 0.1%
 
0.3365 12 < 0.1%
 
0.225 12 < 0.1%
 

TotalAssets
Real number (ℝ≥0)

Distinct count10957
Unique (%)8.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean219956.99324718228
Minimum12888
Maximum3291676
Zeros0
Zeros (%)0.0%
Memory size482.6 KiB

Quantile statistics

Minimum12888
5-th percentile32492
Q166656
median124593
Q3237428
95-th percentile776790
Maximum3291676
Range3278788
Interquartile range (IQR)170772

Descriptive statistics

Standard deviation292515.3581
Coefficient of variation (CV)1.329875235
Kurtosis21.55434127
Mean219956.9932
Median Absolute Deviation (MAD)68531
Skewness3.87835666
Sum2.716556849e+10
Variance8.556523474e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
109916 36 < 0.1%
 
38734 36 < 0.1%
 
75318 36 < 0.1%
 
57709 35 < 0.1%
 
64083 28 < 0.1%
 
143454 24 < 0.1%
 
192969 24 < 0.1%
 
33588 24 < 0.1%
 
38533 24 < 0.1%
 
77826 24 < 0.1%
 
Other values (10947) 123213 99.8%
 
ValueCountFrequency (%) 
12888 12 < 0.1%
 
13265 12 < 0.1%
 
13628 12 < 0.1%
 
13772 12 < 0.1%
 
13943 12 < 0.1%
 
ValueCountFrequency (%) 
3291676 7 < 0.1%
 
3279239 5 < 0.1%
 
3224360 1 < 0.1%
 
3204143 4 < 0.1%
 
3191429 9 < 0.1%
 

folioloan
Real number (ℝ≥0)

Distinct count9114
Unique (%)7.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17339.178010428812
Minimum252
Maximum667768
Zeros0
Zeros (%)0.0%
Memory size482.6 KiB

Quantile statistics

Minimum252
5-th percentile1406
Q14008
median8709
Q318500.5
95-th percentile62136
Maximum667768
Range667516
Interquartile range (IQR)14492.5

Descriptive statistics

Standard deviation29116.91673
Coefficient of variation (CV)1.679255886
Kurtosis60.61486003
Mean17339.17801
Median Absolute Deviation (MAD)5830
Skewness6.135683878
Sum2141457841
Variance847794839.7
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2873 72 0.1%
 
2042 60 < 0.1%
 
4008 60 < 0.1%
 
2063 60 < 0.1%
 
2646 58 < 0.1%
 
2898 56 < 0.1%
 
1617 55 < 0.1%
 
2836 49 < 0.1%
 
3548 48 < 0.1%
 
8352 48 < 0.1%
 
Other values (9104) 122938 99.5%
 
ValueCountFrequency (%) 
252 12 < 0.1%
 
261 12 < 0.1%
 
270 12 < 0.1%
 
278 12 < 0.1%
 
300 12 < 0.1%
 
ValueCountFrequency (%) 
667768 1 < 0.1%
 
619449 3 < 0.1%
 
597301 4 < 0.1%
 
580734 2 < 0.1%
 
449812 4 < 0.1%
 

State
Categorical

Distinct count21
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size965.0 KiB
IA
17358
KY
14406
AL
13296
GA
12414
OK
11604
Other values (16)
54426
ValueCountFrequency (%) 
IA 17358 14.1%
 
KY 14406 11.7%
 
AL 13296 10.8%
 
GA 12414 10.1%
 
OK 11604 9.4%
 
AR 10818 8.8%
 
SD 6432 5.2%
 
MT 5442 4.4%
 
OH 5292 4.3%
 
PA 4920 4.0%
 
Other values (11) 21522 17.4%
 

Length

Max length2
Mean length2
Min length2
ValueCountFrequency (%) 
Uppercase_Letter 20 100.0%
 
ValueCountFrequency (%) 
Latin 20 100.0%
 
ValueCountFrequency (%) 
ASCII 20 100.0%
 

Quarter_Period
Categorical

HIGH CORRELATION
Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size482.6 KiB
2
31398
1
31044
4
30702
3
30360
ValueCountFrequency (%) 
2 31398 25.4%
 
1 31044 25.1%
 
4 30702 24.9%
 
3 30360 24.6%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

Year_Period
Real number (ℝ≥0)

Distinct count21
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2011.2941281254048
Minimum2001
Maximum2021
Zeros0
Zeros (%)0.0%
Memory size482.6 KiB

Quantile statistics

Minimum2001
5-th percentile2003
Q12007
median2011
Q32016
95-th percentile2020
Maximum2021
Range20
Interquartile range (IQR)9

Descriptive statistics

Standard deviation5.275915789
Coefficient of variation (CV)0.002623144828
Kurtosis-1.120449672
Mean2011.294128
Median Absolute Deviation (MAD)4
Skewness0.09573841598
Sum248402870
Variance27.83528742
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2007 7392 6.0%
 
2008 7392 6.0%
 
2005 7392 6.0%
 
2006 7392 6.0%
 
2009 7332 5.9%
 
2010 7152 5.8%
 
2011 6972 5.6%
 
2004 6930 5.6%
 
2012 6828 5.5%
 
2013 6660 5.4%
 
Other values (11) 52062 42.2%
 
ValueCountFrequency (%) 
2001 154 0.1%
 
2002 2156 1.7%
 
2003 4620 3.7%
 
2004 6930 5.6%
 
2005 7392 6.0%
 
ValueCountFrequency (%) 
2021 2724 2.2%
 
2020 5520 4.5%
 
2019 5820 4.7%
 
2018 5964 4.8%
 
2017 6084 4.9%
 

MacroMergeKey
Categorical

HIGH CARDINALITY
HIGH CORRELATION
Distinct count79
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size965.0 KiB
2008Q1
 
1848
2006Q4
 
1848
2005Q1
 
1848
2007Q4
 
1848
2006Q2
 
1848
Other values (74)
114264
ValueCountFrequency (%) 
2008Q1 1848 1.5%
 
2006Q4 1848 1.5%
 
2005Q1 1848 1.5%
 
2007Q4 1848 1.5%
 
2006Q2 1848 1.5%
 
2007Q2 1848 1.5%
 
2005Q4 1848 1.5%
 
2007Q3 1848 1.5%
 
2004Q4 1848 1.5%
 
2008Q4 1848 1.5%
 
Other values (69) 105024 85.0%
 

Length

Max length6
Mean length6
Min length6
ValueCountFrequency (%) 
Decimal_Number 10 90.9%
 
Uppercase_Letter 1 9.1%
 
ValueCountFrequency (%) 
Common 10 90.9%
 
Latin 1 9.1%
 
ValueCountFrequency (%) 
ASCII 11 100.0%
 

foliolossrateLag1
Real number (ℝ≥0)

SKEWED
ZEROS
Distinct count394
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.002081830547998445
Minimum0.0
Maximum0.6563
Zeros88291
Zeros (%)71.5%
Memory size965.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.0003
95-th percentile0.0094
Maximum0.6563
Range0.6563
Interquartile range (IQR)0.0003

Descriptive statistics

Standard deviation0.01239093442
Coefficient of variation (CV)5.951941879
Kurtosis991.5973805
Mean0.002081830548
Median Absolute Deviation (MAD)0
Skewness24.97215878
Sum257.1144
Variance0.0001535352559
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 88291 71.5%
 
0.0001 2115 1.7%
 
0.0002 1805 1.5%
 
0.0003 1599 1.3%
 
0.0004 1540 1.2%
 
0.0005 1188 1.0%
 
0.0006 1003 0.8%
 
0.0007 911 0.7%
 
0.0008 906 0.7%
 
0.0009 788 0.6%
 
Other values (384) 23358 18.9%
 
ValueCountFrequency (%) 
0 88291 71.5%
 
0.0001 2115 1.7%
 
0.0002 1805 1.5%
 
0.0003 1599 1.3%
 
0.0004 1540 1.2%
 
ValueCountFrequency (%) 
0.6563 12 < 0.1%
 
0.4094 12 < 0.1%
 
0.3502 1 < 0.1%
 
0.3365 12 < 0.1%
 
0.225 11 < 0.1%
 

foliolossrateLag2
Real number (ℝ≥0)

SKEWED
ZEROS
Distinct count394
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.002077833916310403
Minimum0.0
Maximum0.6563
Zeros88322
Zeros (%)71.5%
Memory size965.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.0003
95-th percentile0.0094
Maximum0.6563
Range0.6563
Interquartile range (IQR)0.0003

Descriptive statistics

Standard deviation0.01237034001
Coefficient of variation (CV)5.953478722
Kurtosis997.3847226
Mean0.002077833916
Median Absolute Deviation (MAD)0
Skewness25.04617602
Sum256.6208
Variance0.0001530253119
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 88322 71.5%
 
0.0001 2112 1.7%
 
0.0002 1805 1.5%
 
0.0003 1599 1.3%
 
0.0004 1538 1.2%
 
0.0005 1186 1.0%
 
0.0006 1003 0.8%
 
0.0007 910 0.7%
 
0.0008 905 0.7%
 
0.0009 788 0.6%
 
Other values (384) 23336 18.9%
 
ValueCountFrequency (%) 
0 88322 71.5%
 
0.0001 2112 1.7%
 
0.0002 1805 1.5%
 
0.0003 1599 1.3%
 
0.0004 1538 1.2%
 
ValueCountFrequency (%) 
0.6563 12 < 0.1%
 
0.4094 12 < 0.1%
 
0.3502 1 < 0.1%
 
0.3365 12 < 0.1%
 
0.225 10 < 0.1%
 

foliolossrateLag3
Real number (ℝ≥0)

SKEWED
ZEROS
Distinct count394
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.0020764566329835475
Minimum0.0
Maximum0.6563
Zeros88348
Zeros (%)71.5%
Memory size965.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.0003
95-th percentile0.0094
Maximum0.6563
Range0.6563
Interquartile range (IQR)0.0003

Descriptive statistics

Standard deviation0.01236925562
Coefficient of variation (CV)5.956905346
Kurtosis997.7453156
Mean0.002076456633
Median Absolute Deviation (MAD)0
Skewness25.05261385
Sum256.4507
Variance0.0001529984845
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 88348 71.5%
 
0.0001 2111 1.7%
 
0.0002 1804 1.5%
 
0.0003 1597 1.3%
 
0.0004 1537 1.2%
 
0.0005 1185 1.0%
 
0.0006 1003 0.8%
 
0.0007 908 0.7%
 
0.0008 905 0.7%
 
0.0009 788 0.6%
 
Other values (384) 23318 18.9%
 
ValueCountFrequency (%) 
0 88348 71.5%
 
0.0001 2111 1.7%
 
0.0002 1804 1.5%
 
0.0003 1597 1.3%
 
0.0004 1537 1.2%
 
ValueCountFrequency (%) 
0.6563 12 < 0.1%
 
0.4094 12 < 0.1%
 
0.3502 1 < 0.1%
 
0.3365 12 < 0.1%
 
0.225 10 < 0.1%
 

foliolossrateLag4
Real number (ℝ≥0)

SKEWED
ZEROS
Distinct count394
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.002072460001295505
Minimum0.0
Maximum0.6563
Zeros88379
Zeros (%)71.6%
Memory size965.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.0003
95-th percentile0.0093
Maximum0.6563
Range0.6563
Interquartile range (IQR)0.0003

Descriptive statistics

Standard deviation0.01234862331
Coefficient of variation (CV)5.958437461
Kurtosis1003.591655
Mean0.002072460001
Median Absolute Deviation (MAD)0
Skewness25.12720892
Sum255.9571
Variance0.0001524884976
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 88379 71.6%
 
0.0001 2108 1.7%
 
0.0002 1804 1.5%
 
0.0003 1597 1.3%
 
0.0004 1535 1.2%
 
0.0005 1183 1.0%
 
0.0006 1003 0.8%
 
0.0007 907 0.7%
 
0.0008 904 0.7%
 
0.0009 788 0.6%
 
Other values (384) 23296 18.9%
 
ValueCountFrequency (%) 
0 88379 71.6%
 
0.0001 2108 1.7%
 
0.0002 1804 1.5%
 
0.0003 1597 1.3%
 
0.0004 1535 1.2%
 
ValueCountFrequency (%) 
0.6563 12 < 0.1%
 
0.4094 12 < 0.1%
 
0.3502 1 < 0.1%
 
0.3365 12 < 0.1%
 
0.225 9 < 0.1%
 

unemployment
Real number (ℝ≥0)

Distinct count1522
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.694983491615154
Minimum2.495590993
Maximum15.1723584
Zeros0
Zeros (%)0.0%
Memory size965.0 KiB

Quantile statistics

Minimum2.495590993
5-th percentile3.185068551
Q14.278082134
median5.257924334
Q36.610215622
95-th percentile10.04018124
Maximum15.1723584
Range12.67676741
Interquartile range (IQR)2.332133488

Descriptive statistics

Standard deviation2.010227772
Coefficient of variation (CV)0.3529821948
Kurtosis0.8155946679
Mean5.694983492
Median Absolute Deviation (MAD)1.092818065
Skewness1.04890996
Sum703353.2411
Variance4.041015697
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3.7277113 252 0.2%
 
4.067411187 252 0.2%
 
6.227353369 252 0.2%
 
3.602266353 252 0.2%
 
6.023222761 252 0.2%
 
4.421010273 252 0.2%
 
3.707849761 252 0.2%
 
3.683188088 252 0.2%
 
3.918197886 252 0.2%
 
4.558346889 252 0.2%
 
Other values (1512) 120984 98.0%
 
ValueCountFrequency (%) 
2.495590993 216 0.2%
 
2.542976405 216 0.2%
 
2.600537191 96 0.1%
 
2.626554199 216 0.2%
 
2.634195934 24 < 0.1%
 
ValueCountFrequency (%) 
15.1723584 48 < 0.1%
 
14.26063623 60 < 0.1%
 
14.1217988 72 0.1%
 
13.68076803 48 < 0.1%
 
13.57686843 24 < 0.1%
 

unemployment_lag1
Real number (ℝ≥0)

Distinct count1522
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.683969023218494
Minimum2.495590993
Maximum15.1723584
Zeros0
Zeros (%)0.0%
Memory size965.0 KiB

Quantile statistics

Minimum2.495590993
5-th percentile3.185068551
Q14.261845293
median5.257924334
Q36.549145166
95-th percentile10.03169877
Maximum15.1723584
Range12.67676741
Interquartile range (IQR)2.287299873

Descriptive statistics

Standard deviation2.003547698
Coefficient of variation (CV)0.3524909601
Kurtosis0.8595600808
Mean5.683969023
Median Absolute Deviation (MAD)1.082409797
Skewness1.059690138
Sum701992.9102
Variance4.014203378
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3.67724837 252 0.2%
 
3.918197886 252 0.2%
 
3.751768634 252 0.2%
 
4.067411187 252 0.2%
 
6.023222761 252 0.2%
 
4.558346889 252 0.2%
 
3.707849761 252 0.2%
 
6.610215622 252 0.2%
 
3.678512944 252 0.2%
 
5.594452881 252 0.2%
 
Other values (1512) 120984 98.0%
 
ValueCountFrequency (%) 
2.495590993 216 0.2%
 
2.542976405 216 0.2%
 
2.600537191 96 0.1%
 
2.626554199 216 0.2%
 
2.634195934 12 < 0.1%
 
ValueCountFrequency (%) 
15.1723584 48 < 0.1%
 
14.26063623 60 < 0.1%
 
14.1217988 72 0.1%
 
13.68076803 48 < 0.1%
 
13.57686843 24 < 0.1%
 

unemployment_lag6
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1522
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.554389821407638
Minimum2.098344192
Maximum11.88844425
Zeros0
Zeros (%)0.0%
Memory size965.0 KiB

Quantile statistics

Minimum2.098344192
5-th percentile3.185068551
Q14.232770454
median5.178246795
Q36.308300289
95-th percentile9.76804495
Maximum11.88844425
Range9.790100058
Interquartile range (IQR)2.075529835

Descriptive statistics

Standard deviation1.878366188
Coefficient of variation (CV)0.3381768742
Kurtosis0.7053422372
Mean5.554389821
Median Absolute Deviation (MAD)1.002240095
Skewness1.039951821
Sum685989.3605
Variance3.528259537
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
4.60246962 252 0.2%
 
3.707849761 252 0.2%
 
4.135910835 252 0.2%
 
3.918197886 252 0.2%
 
3.751768634 252 0.2%
 
6.023222761 252 0.2%
 
6.48093256 252 0.2%
 
6.610215622 252 0.2%
 
4.558346889 252 0.2%
 
4.621560268 252 0.2%
 
Other values (1512) 120984 98.0%
 
ValueCountFrequency (%) 
2.098344192 6 < 0.1%
 
2.223178778 4 < 0.1%
 
2.318381225 2 < 0.1%
 
2.351268937 8 < 0.1%
 
2.481417151 16 < 0.1%
 
ValueCountFrequency (%) 
11.88844425 192 0.2%
 
11.58972759 192 0.2%
 
11.45527408 192 0.2%
 
11.2427755 12 < 0.1%
 
11.13359728 12 < 0.1%
 

unemployment_lag8
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1522
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.559773213987254
Minimum2.098344192
Maximum11.88844425
Zeros0
Zeros (%)0.0%
Memory size965.0 KiB

Quantile statistics

Minimum2.098344192
5-th percentile3.202344046
Q14.248327368
median5.182593357
Q36.284377349
95-th percentile9.759883387
Maximum11.88844425
Range9.790100058
Interquartile range (IQR)2.036049981

Descriptive statistics

Standard deviation1.860926263
Coefficient of variation (CV)0.3347126207
Kurtosis0.7781509119
Mean5.559773214
Median Absolute Deviation (MAD)0.971353862
Skewness1.060376059
Sum686654.231
Variance3.463046556
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
6.48093256 252 0.2%
 
3.67724837 252 0.2%
 
3.751768634 252 0.2%
 
4.558346889 252 0.2%
 
4.621560268 252 0.2%
 
4.116880643 252 0.2%
 
6.023222761 252 0.2%
 
3.7277113 252 0.2%
 
6.610215622 252 0.2%
 
4.135910835 252 0.2%
 
Other values (1512) 120984 98.0%
 
ValueCountFrequency (%) 
2.098344192 10 < 0.1%
 
2.223178778 8 < 0.1%
 
2.318381225 6 < 0.1%
 
2.351268937 24 < 0.1%
 
2.437222592 16 < 0.1%
 
ValueCountFrequency (%) 
11.88844425 192 0.2%
 
11.58972759 192 0.2%
 
11.45527408 192 0.2%
 
11.2427755 12 < 0.1%
 
11.13359728 12 < 0.1%
 

unemployment_lag2growth
Real number (ℝ)

Distinct count1522
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.005148190227344863
Minimum-0.982071438
Maximum0.773004767
Zeros0
Zeros (%)0.0%
Memory size965.0 KiB

Quantile statistics

Minimum-0.982071438
5-th percentile-0.092725391
Q1-0.037814038
median-0.013647152
Q30.018206033
95-th percentile0.122085742
Maximum0.773004767
Range1.755076205
Interquartile range (IQR)0.056020071

Descriptive statistics

Standard deviation0.1125688204
Coefficient of variation (CV)-21.86570726
Kurtosis27.52816285
Mean-0.005148190227
Median Absolute Deviation (MAD)0.027442198
Skewness-0.02811729613
Sum-635.8220858
Variance0.01267173933
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-0.035683094 252 0.2%
 
-0.011262968 252 0.2%
 
-0.078035296 252 0.2%
 
0.01589381 252 0.2%
 
-0.004049762 252 0.2%
 
0.028475644 252 0.2%
 
-0.018611575 252 0.2%
 
0.182961922 252 0.2%
 
-0.032728289 252 0.2%
 
-0.008976887 252 0.2%
 
Other values (1512) 120984 98.0%
 
ValueCountFrequency (%) 
-0.982071438 60 < 0.1%
 
-0.976853071 24 < 0.1%
 
-0.849671941 180 0.1%
 
-0.836172896 120 0.1%
 
-0.707853611 72 0.1%
 
ValueCountFrequency (%) 
0.773004767 24 < 0.1%
 
0.745285679 48 < 0.1%
 
0.730813227 156 0.1%
 
0.724048284 120 0.1%
 
0.700332637 204 0.2%
 

house_prices_all_change
Real number (ℝ)

Distinct count1506
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.3515546972753916
Minimum-19.3355
Maximum18.9567
Zeros0
Zeros (%)0.0%
Memory size965.0 KiB

Quantile statistics

Minimum-19.3355
5-th percentile-2.4264
Q10.3976
median1.560697
Q32.5386
95-th percentile4.1837
Maximum18.9567
Range38.2922
Interquartile range (IQR)2.141

Descriptive statistics

Standard deviation2.199480165
Coefficient of variation (CV)1.627370442
Kurtosis4.942044019
Mean1.351554697
Median Absolute Deviation (MAD)1.051803
Skewness-0.4934641769
Sum166922.4113
Variance4.837712995
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1.2909 420 0.3%
 
2.0941 399 0.3%
 
1.7657 372 0.3%
 
0.7269 360 0.3%
 
-1.0477 276 0.2%
 
-0.9807 252 0.2%
 
0.8765 252 0.2%
 
1.2766 252 0.2%
 
2.456 252 0.2%
 
1.8731 252 0.2%
 
Other values (1496) 120417 97.5%
 
ValueCountFrequency (%) 
-19.3355 12 < 0.1%
 
-18.0192 12 < 0.1%
 
-14.1975 12 < 0.1%
 
-12.1329 12 < 0.1%
 
-12.0032 12 < 0.1%
 
ValueCountFrequency (%) 
18.9567 12 < 0.1%
 
17.4299 12 < 0.1%
 
13.3951 12 < 0.1%
 
13.1456 12 < 0.1%
 
12.5382 12 < 0.1%
 
Distinct count1522
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.0066821814225936004
Minimum-0.088409475
Maximum0.067258996
Zeros72
Zeros (%)0.1%
Memory size965.0 KiB

Quantile statistics

Minimum-0.088409475
5-th percentile-0.018841937
Q10.000879226
median0.008487691
Q30.014200481
95-th percentile0.02506578
Maximum0.067258996
Range0.155668471
Interquartile range (IQR)0.013321255

Descriptive statistics

Standard deviation0.01325160164
Coefficient of variation (CV)1.98312509
Kurtosis3.421404548
Mean0.006682181423
Median Absolute Deviation (MAD)0.006414513
Skewness-1.076939983
Sum825.2761344
Variance0.0001756049459
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
-0.00385208 252 0.2%
 
0.016895002 252 0.2%
 
0.000757537 252 0.2%
 
0.00587733 252 0.2%
 
0.004268166 252 0.2%
 
0.002351858 252 0.2%
 
0.003126121 252 0.2%
 
0.007557064 252 0.2%
 
0.009976976 252 0.2%
 
0.004129639 252 0.2%
 
Other values (1512) 120984 98.0%
 
ValueCountFrequency (%) 
-0.088409475 12 < 0.1%
 
-0.085150333 12 < 0.1%
 
-0.076952973 12 < 0.1%
 
-0.068677949 12 < 0.1%
 
-0.060490218 156 0.1%
 
ValueCountFrequency (%) 
0.067258996 24 < 0.1%
 
0.061862127 12 < 0.1%
 
0.060875029 12 < 0.1%
 
0.060117048 12 < 0.1%
 
0.052141423 12 < 0.1%
 

house_purchase_prices
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1446
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean207.44293692408993
Minimum129.38
Maximum453.9381743
Zeros0
Zeros (%)0.0%
Memory size965.0 KiB

Quantile statistics

Minimum129.38
5-th percentile158.74
Q1182.55
median195.12
Q3223.92
95-th percentile290.61
Maximum453.9381743
Range324.5581743
Interquartile range (IQR)41.37

Descriptive statistics

Standard deviation41.61252945
Coefficient of variation (CV)0.2005974755
Kurtosis4.910731263
Mean207.4429369
Median Absolute Deviation (MAD)18.68
Skewness1.795312408
Sum25620032.48
Variance1731.602607
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
193.62 420 0.3%
 
193.05 408 0.3%
 
194.7 408 0.3%
 
191.08 396 0.3%
 
187.36 396 0.3%
 
190.01 360 0.3%
 
177.31 348 0.3%
 
184.58 276 0.2%
 
218.96 276 0.2%
 
207.2 276 0.2%
 
Other values (1436) 119940 97.1%
 
ValueCountFrequency (%) 
129.38 6 < 0.1%
 
131.34 1 < 0.1%
 
131.93 12 < 0.1%
 
133.31 2 < 0.1%
 
134.53 18 < 0.1%
 
ValueCountFrequency (%) 
453.9381743 48 < 0.1%
 
449.4049999 48 < 0.1%
 
444.9081523 48 < 0.1%
 
440.7307527 48 < 0.1%
 
437.96 48 < 0.1%
 

house_prices_all
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1522
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean209.9415326029829
Minimum129.617239
Maximum429.6032958
Zeros0
Zeros (%)0.0%
Memory size965.0 KiB

Quantile statistics

Minimum129.617239
5-th percentile164.3367
Q1187.4485
median200.2834
Q3226.0938
95-th percentile293.0201
Maximum429.6032958
Range299.9860568
Interquartile range (IQR)38.6453

Descriptive statistics

Standard deviation38.51989255
Coefficient of variation (CV)0.1834791433
Kurtosis4.878775828
Mean209.9415326
Median Absolute Deviation (MAD)16.5638
Skewness1.813143952
Sum25928619.04
Variance1483.782122
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
200.1329 252 0.2%
 
203.0536 252 0.2%
 
203.905 252 0.2%
 
202.6849 252 0.2%
 
185.2876 252 0.2%
 
202.7396 252 0.2%
 
205.0316 252 0.2%
 
202.6299 252 0.2%
 
200.5655 252 0.2%
 
203.9165 252 0.2%
 
Other values (1512) 120984 98.0%
 
ValueCountFrequency (%) 
129.617239 1 < 0.1%
 
131.5009408 6 < 0.1%
 
131.9723 2 < 0.1%
 
133.6875 12 < 0.1%
 
135.2964 3 < 0.1%
 
ValueCountFrequency (%) 
429.6032958 48 < 0.1%
 
425.412164 48 < 0.1%
 
421.3011489 48 < 0.1%
 
417.508769 48 < 0.1%
 
414.414 48 < 0.1%
 

CommercialPriceNat
Real number (ℝ≥0)

Distinct count79
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean217531.68148488144
Minimum139008.0
Maximum308497.0
Zeros0
Zeros (%)0.0%
Memory size965.0 KiB

Quantile statistics

Minimum139008
5-th percentile152190
Q1177845
median214377
Q3247261
95-th percentile303569.6
Maximum308497
Range169489
Interquartile range (IQR)69416

Descriptive statistics

Standard deviation47030.79894
Coefficient of variation (CV)0.2162020659
Kurtosis-0.9311868494
Mean217531.6815
Median Absolute Deviation (MAD)36319
Skewness0.3481390062
Sum2.686603279e+10
Variance2211896049
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
247035 1848 1.5%
 
178058 1848 1.5%
 
229890 1848 1.5%
 
203978 1848 1.5%
 
247261 1848 1.5%
 
174327 1848 1.5%
 
190430 1848 1.5%
 
198317 1848 1.5%
 
179040 1848 1.5%
 
220299 1848 1.5%
 
Other values (69) 105024 85.0%
 
ValueCountFrequency (%) 
139008 154 0.1%
 
139077 308 0.2%
 
139601 462 0.4%
 
141168 616 0.5%
 
144686 770 0.6%
 
ValueCountFrequency (%) 
308497 1452 1.2%
 
304171 1392 1.1%
 
304098.12 1368 1.1%
 
303707.2128 1356 1.1%
 
303569.6 1368 1.1%
 

CommercialPriceNat_lag8
Real number (ℝ≥0)

Distinct count79
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200700.50147363648
Minimum125869
Maximum300927
Zeros0
Zeros (%)0.0%
Memory size482.6 KiB

Quantile statistics

Minimum125869
5-th percentile141168
Q1166808
median196712
Q3237599
95-th percentile277817
Maximum300927
Range175058
Interquartile range (IQR)70791

Descriptive statistics

Standard deviation43308.57886
Coefficient of variation (CV)0.2157870984
Kurtosis-0.9416706528
Mean200700.5015
Median Absolute Deviation (MAD)33415
Skewness0.3113211427
Sum2.478731473e+10
Variance1875633003
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
153583 1848 1.5%
 
184598 1848 1.5%
 
147309 1848 1.5%
 
203978 1848 1.5%
 
144686 1848 1.5%
 
212035 1848 1.5%
 
163297 1848 1.5%
 
174327 1848 1.5%
 
220299 1848 1.5%
 
178058 1848 1.5%
 
Other values (69) 105024 85.0%
 
ValueCountFrequency (%) 
125869 462 0.4%
 
126885 308 0.2%
 
129783 154 0.1%
 
139008 1386 1.1%
 
139062 616 0.5%
 
ValueCountFrequency (%) 
300927 1356 1.1%
 
287370 1368 1.1%
 
285944 1392 1.1%
 
278999 1368 1.1%
 
277817 1368 1.1%
 

nominal_gdp_lag8
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1522
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean60765.90104343834
Minimum3952.087693
Maximum432012.5597
Zeros0
Zeros (%)0.0%
Memory size965.0 KiB

Quantile statistics

Minimum3952.087693
5-th percentile9063.063532
Q127203.89148
median40863.11718
Q371132.62112
95-th percentile168661.2564
Maximum432012.5597
Range428060.472
Interquartile range (IQR)43928.72964

Descriptive statistics

Standard deviation61297.14257
Coefficient of variation (CV)1.008742428
Kurtosis9.399455283
Mean60765.90104
Median Absolute Deviation (MAD)15420.14964
Skewness2.714705464
Sum7504831842
Variance3757339687
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
28939.62637 252 0.2%
 
24951.58839 252 0.2%
 
34010.01956 252 0.2%
 
34533.37345 252 0.2%
 
31556.07795 252 0.2%
 
33214.87903 252 0.2%
 
27300.14666 252 0.2%
 
34371.77672 252 0.2%
 
25446.96414 252 0.2%
 
30239.52193 252 0.2%
 
Other values (1512) 120984 98.0%
 
ValueCountFrequency (%) 
3952.087693 2 < 0.1%
 
4153.654387 4 < 0.1%
 
4238.742412 6 < 0.1%
 
4296.262941 8 < 0.1%
 
4434.435409 10 < 0.1%
 
ValueCountFrequency (%) 
432012.5597 48 < 0.1%
 
427403.5079 48 < 0.1%
 
421128.1628 48 < 0.1%
 
420487.2223 48 < 0.1%
 
418144.2897 48 < 0.1%
 
Distinct count1522
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean493.13072093102244
Minimum-9426.3497
Maximum11713.7075
Zeros0
Zeros (%)0.0%
Memory size965.0 KiB

Quantile statistics

Minimum-9426.3497
5-th percentile-425.8207
Q1120.88719
median345.12601
Q3702.90466
95-th percentile1880.72136
Maximum11713.7075
Range21140.0572
Interquartile range (IQR)582.01747

Descriptive statistics

Standard deviation985.6445106
Coefficient of variation (CV)1.99874895
Kurtosis30.37445955
Mean493.1307209
Median Absolute Deviation (MAD)262.71462
Skewness2.343604047
Sum60903616.56
Variance971495.1012
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
834.8121 252 0.2%
 
702.90466 252 0.2%
 
169.18178 252 0.2%
 
743.43235 252 0.2%
 
306.63964 252 0.2%
 
-603.27642 252 0.2%
 
-293.39918 252 0.2%
 
145.13942 252 0.2%
 
812.00485 252 0.2%
 
333.46371 252 0.2%
 
Other values (1512) 120984 98.0%
 
ValueCountFrequency (%) 
-9426.3497 48 < 0.1%
 
-6492.8349 12 < 0.1%
 
-5920.1985 72 0.1%
 
-4813.3587 12 < 0.1%
 
-4520.4556 12 < 0.1%
 
ValueCountFrequency (%) 
11713.7075 48 < 0.1%
 
10759.2884 48 < 0.1%
 
10568.9135 72 0.1%
 
9385.1367 48 < 0.1%
 
7705.2223 48 < 0.1%
 
Distinct count1522
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean295.9411139869721
Minimum-12349.038
Maximum22384.4284
Zeros0
Zeros (%)0.0%
Memory size965.0 KiB

Quantile statistics

Minimum-12349.038
5-th percentile-673.5872
Q1-1.48357
median178.34725
Q3472.85152
95-th percentile1458.5331
Maximum22384.4284
Range34733.4664
Interquartile range (IQR)474.33509

Descriptive statistics

Standard deviation1165.230815
Coefficient of variation (CV)3.937373891
Kurtosis102.8879978
Mean295.941114
Median Absolute Deviation (MAD)213.22581
Skewness5.819756086
Sum36549911.34
Variance1357762.853
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
264.55449 252 0.2%
 
536.81781 252 0.2%
 
781.52293 252 0.2%
 
19.18525 252 0.2%
 
880.91336 252 0.2%
 
-801.68175 252 0.2%
 
-369.11925 252 0.2%
 
-12.55974 252 0.2%
 
-147.50749 252 0.2%
 
-3.07013 252 0.2%
 
Other values (1512) 120984 98.0%
 
ValueCountFrequency (%) 
-12349.038 48 < 0.1%
 
-8872.9722 24 < 0.1%
 
-6819.1658 48 < 0.1%
 
-5903.3431 12 < 0.1%
 
-5561.21761 12 < 0.1%
 
ValueCountFrequency (%) 
22384.4284 48 < 0.1%
 
18920.3288 60 < 0.1%
 
12900.6171 72 0.1%
 
9830.8583 18 < 0.1%
 
8946.1945 120 0.1%
 

real_gdp
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1522
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean65780.75677833655
Minimum7411.675016
Maximum367573.8026
Zeros0
Zeros (%)0.0%
Memory size965.0 KiB

Quantile statistics

Minimum7411.675016
5-th percentile10063.97722
Q131695.66442
median43621.26281
Q377821.81543
95-th percentile169763.5913
Maximum367573.8026
Range360162.1276
Interquartile range (IQR)46126.15101

Descriptive statistics

Standard deviation63694.51972
Coefficient of variation (CV)0.9682849946
Kurtosis6.985860667
Mean65780.75678
Median Absolute Deviation (MAD)15135.7457
Skewness2.456860035
Sum8124186585
Variance4056991843
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
37382.6956 252 0.2%
 
36790.95507 252 0.2%
 
35944.74247 252 0.2%
 
38793.31593 252 0.2%
 
37483.34686 252 0.2%
 
35561.96251 252 0.2%
 
36607.50875 252 0.2%
 
36298.74146 252 0.2%
 
36491.24528 252 0.2%
 
36198.25301 252 0.2%
 
Other values (1512) 120984 98.0%
 
ValueCountFrequency (%) 
7411.675016 10 < 0.1%
 
7425.190462 4 < 0.1%
 
7441.468034 2 < 0.1%
 
7470.711468 6 < 0.1%
 
7484.861569 12 < 0.1%
 
ValueCountFrequency (%) 
367573.8026 48 < 0.1%
 
366065.3994 48 < 0.1%
 
365188.1085 48 < 0.1%
 
364154.7401 48 < 0.1%
 
361163.3535 48 < 0.1%
 

RepDate
Categorical

HIGH CARDINALITY
HIGH CORRELATION
Distinct count79
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size965.0 KiB
12/31/2002
 
1848
6/30/2006
 
1848
6/30/2003
 
1848
12/31/2005
 
1848
12/31/2003
 
1848
Other values (74)
114264
ValueCountFrequency (%) 
12/31/2002 1848 1.5%
 
6/30/2006 1848 1.5%
 
6/30/2003 1848 1.5%
 
12/31/2005 1848 1.5%
 
12/31/2003 1848 1.5%
 
6/30/2002 1848 1.5%
 
3/31/2005 1848 1.5%
 
3/31/2006 1848 1.5%
 
9/30/2004 1848 1.5%
 
12/31/2001 1848 1.5%
 
Other values (69) 105024 85.0%
 

Length

Max length10
Mean length9.251870385
Min length9
ValueCountFrequency (%) 
Decimal_Number 10 90.9%
 
Other_Punctuation 1 9.1%
 
ValueCountFrequency (%) 
Common 11 100.0%
 
ValueCountFrequency (%) 
ASCII 11 100.0%
 

MovingAverage
Real number (ℝ≥0)

ZEROS
Distinct count37715
Unique (%)30.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.002081078845583949
Minimum0.0
Maximum0.251820313
Zeros48089
Zeros (%)38.9%
Memory size965.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0.000210279
Q30.0017344675
95-th percentile0.009296079
Maximum0.251820313
Range0.251820313
Interquartile range (IQR)0.0017344675

Descriptive statistics

Standard deviation0.006530331535
Coefficient of variation (CV)3.137954888
Kurtosis272.5628123
Mean0.002081078846
Median Absolute Deviation (MAD)0.000210279
Skewness12.57295768
Sum257.0215617
Variance4.264522996e-05
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 48089 38.9%
 
0.000125 284 0.2%
 
3.91e-05 230 0.2%
 
2.5e-05 209 0.2%
 
0.0001 205 0.2%
 
0.00015625 198 0.2%
 
7.5e-05 193 0.2%
 
3.13e-05 191 0.2%
 
5e-05 189 0.2%
 
0.00025 181 0.1%
 
Other values (37705) 73535 59.5%
 
ValueCountFrequency (%) 
0 48089 38.9%
 
6.25e-06 41 < 0.1%
 
7.81e-06 41 < 0.1%
 
9.01e-06 41 < 0.1%
 
9.7e-06 40 < 0.1%
 
ValueCountFrequency (%) 
0.251820313 1 < 0.1%
 
0.223474591 1 < 0.1%
 
0.219079934 1 < 0.1%
 
0.219078132 1 < 0.1%
 
0.219070923 1 < 0.1%
 

Target
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size482.6 KiB
0
88260
1
35244
ValueCountFrequency (%) 
0 88260 71.5%
 
1 35244 28.5%
 

P1
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size482.6 KiB
0
112365
1
 
11139
ValueCountFrequency (%) 
0 112365 91.0%
 
1 11139 9.0%
 

P10
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size482.6 KiB
0
113751
1
 
9753
ValueCountFrequency (%) 
0 113751 92.1%
 
1 9753 7.9%
 

P11
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size482.6 KiB
0
113905
1
 
9599
ValueCountFrequency (%) 
0 113905 92.2%
 
1 9599 7.8%
 

P12
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size482.6 KiB
0
114059
1
 
9445
ValueCountFrequency (%) 
0 114059 92.4%
 
1 9445 7.6%
 

P2
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size482.6 KiB
0
112519
1
 
10985
ValueCountFrequency (%) 
0 112519 91.1%
 
1 10985 8.9%
 

P3
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size482.6 KiB
0
112673
1
 
10831
ValueCountFrequency (%) 
0 112673 91.2%
 
1 10831 8.8%
 

P4
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size482.6 KiB
0
112827
1
 
10677
ValueCountFrequency (%) 
0 112827 91.4%
 
1 10677 8.6%
 

P5
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size482.6 KiB
0
112981
1
 
10523
ValueCountFrequency (%) 
0 112981 91.5%
 
1 10523 8.5%
 

P6
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size482.6 KiB
0
113135
1
 
10369
ValueCountFrequency (%) 
0 113135 91.6%
 
1 10369 8.4%
 

P7
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size482.6 KiB
0
113289
1
 
10215
ValueCountFrequency (%) 
0 113289 91.7%
 
1 10215 8.3%
 

P8
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size482.6 KiB
0
113443
1
 
10061
ValueCountFrequency (%) 
0 113443 91.9%
 
1 10061 8.1%
 

P9
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size482.6 KiB
0
113597
1
 
9907
ValueCountFrequency (%) 
0 113597 92.0%
 
1 9907 8.0%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

FDICCertSnapshotdatePeriodDatePeriodfoliolossrateTotalAssetsfolioloanStateQuarter_PeriodYear_PeriodMacroMergeKeyfoliolossrateLag1foliolossrateLag2foliolossrateLag3foliolossrateLag4unemploymentunemployment_lag1unemployment_lag6unemployment_lag8unemployment_lag2growthhouse_prices_all_changehouse_purchase_prices_growthhouse_purchase_priceshouse_prices_allCommercialPriceNatCommercialPriceNat_lag8nominal_gdp_lag8nominal_personalincome_lag5changereal_disposableincome_lag3changereal_gdpRepDateMovingAverageTargetP1P10P11P12P2P3P4P5P6P7P8P9
0359/30/200112/31/2001P10.010745129762657AL420012001Q40.00000.00000.00000.00005.8830695.1310724.4867854.756324-0.0022111.3354330.007326147.43151.39665139008.012978329373.70138101.76065-60.2537139073.169549/30/20010.01100000000000
1359/30/20013/31/2002P20.005945129762657AL120022002Q10.01070.00000.00000.00005.9574805.8830694.5593294.7293700.0734160.4684500.007673148.57151.86510139077.012688529367.31942393.61405147.9246939632.225119/30/20010.01000010000000
2359/30/20016/30/2002P30.000945129762657AL220022002Q20.00590.01070.00000.00005.9655675.9574804.4857064.4867850.1278240.9320000.008277149.81152.79710139601.012586929896.46623251.99019538.8559739935.057529/30/20010.01000001000000
3359/30/20019/30/2002P40.012945129762657AL320022002Q30.00090.00590.01070.00005.8142025.9655674.7648824.5593290.0124902.2129000.006301150.76155.01000141168.013906229990.83020305.15501-290.3573540174.308509/30/20010.01000000100000
4359/30/200112/31/2002P50.000045129762657AL420022002Q40.01290.00090.00590.01075.8146645.8142024.7543684.4857060.0013561.5291000.017082153.38156.53910144686.014380130213.04346-9.80572587.9591140307.793499/30/20010.00000000010000
5359/30/20013/31/2003P60.004445129762657AL120032003Q10.00000.01290.00090.00595.7505685.8146645.1310724.764882-0.0260341.7776000.006349154.36158.31670152190.014301130313.83179102.58368282.7523740306.539819/30/20010.01000000001000
6359/30/20016/30/2003P70.000045129762657AL220032003Q20.00440.00000.01290.00096.1203305.7505685.8830694.7543680.0000791.5436000.009942155.91159.86030151098.014193130679.89770152.1019158.2240140654.049729/30/20010.00000000000100
7359/30/20019/30/2003P80.001345129762657AL320032003Q30.00000.00440.00000.01296.1798446.1203305.9574805.131072-0.0111460.9834000.017828158.74160.84370149218.014361430741.98045394.11802209.1585241424.211369/30/20010.01000000000010
8359/30/200112/31/2003P90.001745129762657AL420032003Q40.00130.00000.00440.00006.0316636.1798445.9655675.8830690.0604150.8440000.003077159.23161.68770147309.013900831025.64922182.44981289.8206042128.716259/30/20010.01000000000001
9359/30/20013/31/2004P100.000045129762657AL120042004Q10.00170.00130.00000.00445.9673376.0316635.8142025.9574800.0096301.7543000.007913160.50163.44200153583.013907731480.40015284.61906243.8855243021.157379/30/20010.00010000000000

Last rows

FDICCertSnapshotdatePeriodDatePeriodfoliolossrateTotalAssetsfolioloanStateQuarter_PeriodYear_PeriodMacroMergeKeyfoliolossrateLag1foliolossrateLag2foliolossrateLag3foliolossrateLag4unemploymentunemployment_lag1unemployment_lag6unemployment_lag8unemployment_lag2growthhouse_prices_all_changehouse_purchase_prices_growthhouse_purchase_priceshouse_prices_allCommercialPriceNatCommercialPriceNat_lag8nominal_gdp_lag8nominal_personalincome_lag5changereal_disposableincome_lag3changereal_gdpRepDateMovingAverageTargetP1P10P11P12P2P3P4P5P6P7P8P9
1234948476/30/20209/30/2020P10.0659633460WV320202020Q30.00.00.00.09.08605713.1242874.9024165.0788830.0564142.8636090.015316227.118440234.146209303569.600027781719680.42302-1.32120-96.9752717061.576436/30/20200.0014750100000000000
1234958476/30/202012/31/2020P20.0659633460WV420202020Q40.00.00.00.07.9291249.0860574.6664984.9338670.5952372.7756490.013593230.248107236.921858304098.120027899919698.92991-11.57153121.7555817245.349126/30/20200.0018440000010000000
1234968476/30/20203/31/2021P30.0659633460WV120212021Q10.00.00.00.07.6610057.9291244.9117584.902416-0.4444422.7410080.014098233.540597239.662866303218.344028737019443.56365-50.844302623.1528117331.382686/30/20200.0008300000001000000
1234978476/30/20206/30/2021P40.0659633460WV220212021Q20.00.00.00.07.4240187.6610055.0125424.666498-0.1459092.7610380.013974236.850303242.423904303707.212830092719549.84498211.50576-752.4953017437.821426/30/20200.0010370000000100000
1234988479/30/202012/31/2020P10.0656133747WV420202020Q40.00.00.00.07.9291249.0860574.6664984.9338670.5952372.7756490.013593230.248107236.921858304098.120027899919698.92991-11.57153121.7555817245.349129/30/20200.0014750100000000000
1234998479/30/20203/31/2021P20.0656133747WV120212021Q10.00.00.00.07.6610057.9291244.9117584.902416-0.4444422.7410080.014098233.540597239.662866303218.344028737019443.56365-50.844302623.1528117331.382689/30/20200.0003690000010000000
1235008479/30/20206/30/2021P30.0656133747WV220212021Q20.00.00.00.07.4240187.6610055.0125424.666498-0.1459092.7610380.013974236.850303242.423904303707.212830092719549.84498211.50576-752.4953017437.821429/30/20200.0004610000001000000
12350184712/31/20203/31/2021P10.0676603523WV120212021Q10.00.00.00.07.6610057.9291244.9117584.902416-0.4444422.7410080.014098233.540597239.662866303218.344028737019443.56365-50.844302623.1528117331.3826812/31/20200.0000000100000000000
12350284712/31/20206/30/2021P20.0676603523WV220212021Q20.00.00.00.07.4240187.6610055.0125424.666498-0.1459092.7610380.013974236.850303242.423904303707.212830092719549.84498211.50576-752.4953017437.8214212/31/20200.0000000000010000000
1235038473/31/20216/30/2021P10.0712543643WV220212021Q20.00.00.00.07.4240187.6610055.0125424.666498-0.1459092.7610380.013974236.850303242.423904303707.212830092719549.84498211.50576-752.4953017437.821423/31/20210.0000000100000000000